Poisson distribution

Poisson
Probability mass function The horizontal axis is the index k, the number of occurrences. The function is only defined at integer values of k. The connecting lines are only guides for the eye.
Cumulative distribution function The horizontal axis is the index k, the number of occurrences. The CDF is discontinuous at the integers of k and flat everywhere else because a variable that is Poisson distributed only takes on integer values.
Notation	$\mathrm{Pois}(\lambda)\,$
Parameters	λ > 0 (real)
Support	k ∈ { 0, 1, 2, 3, ... }
PMF	$\frac{\lambda^k}{k!}\cdot e^{-\lambda}$
CDF	$\frac{\Gamma(\lfloor k%2B1\rfloor, \lambda)}{\lfloor k\rfloor�!}\!$ for $k\ge 0$ or $e^{-\lambda} \sum_{i=0}^{k} \frac{\lambda^i}{i!}\$ (where $\Gamma(x, y)\,\!$ is the Incomplete gamma function and $\lfloor k\rfloor$ is the floor function)
Mean	$\lambda\,\!$
Median	$\approx\lfloor\lambda%2B1/3-0.02/\lambda\rfloor$
Mode	$\lceil\lambda\rceil - 1$
Variance	$\lambda\,\!$
Skewness	$\lambda^{-1/2}\,$
Ex. kurtosis	$\lambda^{-1}\,$
Entropy	$\lambda[1\!-\!\log(\lambda)]\!%2B\!e^{-\lambda}\sum_{k=0}^\infty \frac{\lambda^k\log(k!)}{k!}$ (for large $\lambda$ ) $\frac{1}{2}\log(2 \pi e \lambda) - \frac{1}{12 \lambda} - \frac{1}{24 \lambda^2} -$ $\frac{19}{360 \lambda^3} %2B O(\frac{1}{\lambda^4})$
MGF	$\exp(\lambda (e^{t}-1))\,$
CF	$\exp(\lambda (e^{it}-1))\,$

In probability theory and statistics, the Poisson distribution (pronounced [pwasɔ̃]) (or Poisson law of small numbers^[1]) is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time and/or space if these events occur with a known average rate and independently of the time since the last event. (The Poisson distribution can also be used for the number of events in other specified intervals such as distance, area or volume.)

History

The distribution was first introduced by Siméon Denis Poisson (1781–1840) and published, together with his probability theory, in 1837 in his work Recherches sur la probabilité des jugements en matière criminelle et en matière civile (“Research on the Probability of Judgments in Criminal and Civil Matters”).^[2] The work focused on certain random variables N that count, among other things, the number of discrete occurrences (sometimes called “arrivals”) that take place during a time-interval of given length.

The first practical application of this distribution was done by Ladislaus Bortkiewicz in 1898 when he was given the task to investigate the number of soldiers of the Prussian army killed accidentally by horse kick; this experiment introduced Poisson distribution to the field of reliability engineering.^[3]

Applications

Applications of Poisson distribution can be found in every field related to counting:

Electrical system example: telephone calls arriving in a system.
Astronomy example: photons arriving at a telescope.
Biology example: the number of mutations on a given strand of DNA.

The distribution equation

If the expected number of occurrences in a given interval is λ, then the probability that there are exactly k occurrences (k being a non-negative integer, k = 0, 1, 2, ...) is equal to

$f(k; \lambda)=\frac{\lambda^k e^{-\lambda}}{k!},\,\!$

where

e is the base of the natural logarithm (e = 2.71828...)
k is the number of occurrences of an event — the probability of which is given by the function
k! is the factorial of k
λ is a positive real number, equal to the expected number of occurrences during the given interval. For instance, if the events occur on average 4 times per minute, and one is interested in the probability of an event occurring k times in a 10 minute interval, one would use a Poisson distribution as the model with λ = 10×4 = 40.

As a function of k, this is the probability mass function. The Poisson distribution can be derived as a limiting case of the binomial distribution.

The Poisson distribution can be applied to systems with a large number of possible events, each of which is rare. The Poisson distribution is sometimes called a Poissonian.

Poisson noise and characterizing small occurrences

The parameter λ is not only the mean number of occurrences $E[k]$ , but also its variance $\sigma_k^2=E[k^2]-E[k]^2$ (see Table). Thus, the number of observed occurrences fluctuates about its mean λ with a standard deviation $\sigma_k =\sqrt{\lambda}$ . These fluctuations are denoted as Poisson noise or (particularly in electronics) as shot noise.

The correlation of the mean and standard deviation in counting independent discrete occurrences is useful scientifically. By monitoring how the fluctuations vary with the mean signal, one can estimate the contribution of a single occurrence, even if that contribution is too small to be detected directly. For example, the charge e on an electron can be estimated by correlating the magnitude of an electric current with its shot noise. If N electrons pass a point in a given time t on the average, the mean current is $I=eN/t$ ; since the current fluctuations should be of the order $\sigma_I=e\sqrt{N}/t$ (i.e., the standard deviation of the Poisson process), the charge $e$ can be estimated from the ratio $\sigma_I^2/I$ . An everyday example is the graininess that appears as photographs are enlarged; the graininess is due to Poisson fluctuations in the number of reduced silver grains, not to the individual grains themselves. By correlating the graininess with the degree of enlargement, one can estimate the contribution of an individual grain (which is otherwise too small to be seen unaided). Many other molecular applications of Poisson noise have been developed, e.g., estimating the number density of receptor molecules in a cell membrane.

$\Pr(N_t=k) = f(k;\lambda t) = \frac{e^{-\lambda t} (\lambda t)^k}{k!}$

Related distributions

If $X_1 \sim \mathrm{Pois}(\lambda_1)\,$ and $X_2 \sim \mathrm{Pois}(\lambda_2)\,$ are independent, then the difference $Y = X_1 - X_2$ follows a Skellam distribution.
If $X_1 \sim \mathrm{Pois}(\lambda_1)\,$ and $X_2 \sim \mathrm{Pois}(\lambda_2)\,$ are independent, and $Y = X_1 %2B X_2$ , then the distribution of $X_1$ conditional on $Y=y$ is a binomial. Specifically, $X_1|(Y=y) \sim \mathrm{Binom}(y, \lambda_1/(\lambda_1%2B\lambda_2))\,$ . More generally, if X₁, X₂,..., X_n are independent Poisson random variables with parameters λ₁, λ₂,..., λ_n then

$X_i \left|\sum_{j=1}^n X_j\right. \sim \mathrm{Binom}\left(\sum_{j=1}^nX_j,\frac{\lambda_i}{\sum_{j=1}^n\lambda_j}\right)$

The Poisson distribution can be derived as a limiting case to the binomial distribution as the number of trials goes to infinity and the expected number of successes remains fixed — see law of rare events below. Therefore it can be used as an approximation of the binomial distribution if n is sufficiently large and p is sufficiently small. There is a rule of thumb stating that the Poisson distribution is a good approximation of the binomial distribution if n is at least 20 and p is smaller than or equal to 0.05, and an excellent approximation if n ≥ 100 and np ≤ 10.^[4]
For sufficiently large values of λ, (say λ>1000), the normal distribution with mean λ and variance λ (standard deviation $\sqrt{\lambda}$ ), is an excellent approximation to the Poisson distribution. If λ is greater than about 10, then the normal distribution is a good approximation if an appropriate continuity correction is performed, i.e., P(X ≤ x), where (lower-case) x is a non-negative integer, is replaced by P(X ≤ x + 0.5).

$F_\mathrm{Poisson}(x;\lambda) \approx F_\mathrm{normal}(x;\mu=\lambda,\sigma^2=\lambda)\,$

Variance-stabilizing transformation: When a variable is Poisson distributed, its square root is approximately normally distributed with expected value of about $\sqrt \lambda$ and variance of about 1/4.^[5] Under this transformation, the convergence to normality is far faster than the untransformed variable. Other, slightly more complicated, variance stabilizing transformations are available,^[6] one of which is Anscombe transform. See Data transformation (statistics) for more general uses of transformations.
If the number of arrivals in any given time interval $[0,t]$ follows the Poisson distribution, with mean = $\lambda t$ , then the lengths of the inter-arrival times follow the Exponential distribution, with mean $1/\lambda$ .

Occurrence

The Poisson distribution arises in connection with Poisson processes. It applies to various phenomena of discrete properties (that is, those that may happen 0, 1, 2, 3, ... times during a given period of time or in a given area) whenever the probability of the phenomenon happening is constant in time or space. Examples of events that may be modelled as a Poisson distribution include:

The number of soldiers killed by horse-kicks each year in each corps in the Prussian cavalry. This example was made famous by a book of Ladislaus Josephovich Bortkiewicz (1868–1931).
The number of yeast cells used when brewing Guinness beer. This example was made famous by William Sealy Gosset (1876–1937).^[7]
The number of phone calls arriving at a call centre per minute.
The number of goals in sports involving two competing teams.
The number of deaths per year in a given age group.
The number of jumps in a stock price in a given time interval.
Under an assumption of homogeneity, the number of times a web server is accessed per minute.
The number of mutations in a given stretch of DNA after a certain amount of radiation.
The proportion of cells that will be infected at a given multiplicity of infection.

How does this distribution arise? — The law of rare events

In several of the above examples—such as, the number of mutations in a given sequence of DNA—the events being counted are actually the outcomes of discrete trials, and would more precisely be modelled using the binomial distribution, that is

$X \sim \textrm{B}(n,p). \,$

In such cases n is very large and p is very small (and so the expectation np is of intermediate magnitude). Then the distribution may be approximated by the less cumbersome Poisson distribution

$X \sim \textrm{Pois}(np). \,$

This is sometimes known as the law of rare events, since each of the n individual Bernoulli events rarely occurs. The name may be misleading because the total count of success events in a Poisson process need not be rare if the parameter np is not small. For example, the number of telephone calls to a busy switchboard in one hour follows a Poisson distribution with the events appearing frequent to the operator, but they are rare from the point of view of the average member of the population who is very unlikely to make a call to that switchboard in that hour.

Proof

We will prove that, for fixed $\lambda$ , if

$X_n \sim \textrm{B}(n,\lambda /n); \qquad Y\sim\textrm{Pois}(\lambda). \,$

then for each fixed k

$\lim_{n\to\infty}P(X_n=k) = P(Y=k)$ .

To see the connection with the above discussion, for any Binomial random variable with large n and small p set $\lambda=np$ . Note that the expectation $E(X_n)=\lambda$ is fixed with respect to n.

First, recall from calculus

$\lim_{n\to\infty}\left(1-{\lambda \over n}\right)^n=e^{-\lambda},$

then since $p = \lambda/n$ in this case, we have

$\begin{align} \lim_{n\to\infty} P(X_n=k)&=\lim_{n\to\infty}{n \choose k} p^k (1-p)^{n-k} \\ &=\lim_{n\to\infty}{n! \over (n-k)!k!} \left({\lambda \over n}\right)^k \left(1-{\lambda\over n}\right)^{n-k}\\ &=\lim_{n\to\infty} \underbrace{\left[\frac{n!}{n^k\left(n-k\right)!}\right]}_{A_n} \left(\frac{\lambda^k}{k!}\right) \underbrace{\left(1-\frac{\lambda}{n}\right)^n}_{\to\exp\left(-\lambda\right)} \underbrace{\left(1-\frac{\lambda}{n}\right)^{-k}}_{\to 1} \\ &= \left[ \lim_{n\to\infty} A_n \right] \left(\frac{\lambda^k}{k!}\right)\exp\left(-\lambda\right) \end{align}$

Next, note that

$\begin{align} A_n &= \frac{n!}{n^k\left(n-k\right)!}\\ &= \frac{n\cdot (n-1)\cdots \big(n-(k-1)\big)}{n^k}\\ &= 1\cdot(1-\tfrac{1}{n})\cdots(1-\tfrac{k-1}{n})\\ &\to 1\cdot 1\cdots 1 = 1, \end{align}$

where we have taken the limit of each of the terms independently, which is permitted since there is a fixed number of terms with respect to n (there are k of them). Consequently, we have shown that

$\lim_{n\to\infty}P(X_n=k) = \frac{\lambda^k \exp\left(-\lambda\right)}{k!} = P(Y=k)$ .

Generalization

We have shown that if

$X_n \sim \textrm{B}(n,p_n); \qquad Y\sim\textrm{Pois}(\lambda), \,$

where $p_n=\lambda / n$ , then $X_n\to Y$ in distribution. This holds in the more general situation that $p_n$ is any sequence such that

$\lim_{n\rightarrow\infty} np_n = \lambda.$

2-dimensional Poisson process

Main article: Poisson process

$P(N(D)=k)=\frac{(\lambda|D|)^k e^{-\lambda|D|}}{k!}$

where

e is the base of the natural logarithm (e = 2.71828...)
k is the number of occurrences of an event - the probability of which is given by the function
k! is the factorial of k
D is the 2-dimensional region
|D| is the area of the region
N(D) is the number of points in the process in region D

Properties

The expected value of a Poisson-distributed random variable is equal to λ and so is its variance. The higher moments of the Poisson distribution are Touchard polynomials in λ, whose coefficients have a combinatorial meaning. In fact, when the expected value of the Poisson distribution is 1, then Dobinski's formula says that the nth moment equals the number of partitions of a set of size n.
The mode of a Poisson-distributed random variable with non-integer λ is equal to $\scriptstyle\lfloor \lambda \rfloor$ , which is the largest integer less than or equal to λ. This is also written as floor(λ). When λ is a positive integer, the modes are λ and λ − 1.
Given one event (or any number) the expected number of other events is independent so still λ. If reproductive success follows a Poisson distribution with expected number of offspring λ, then for a given individual the expected number of (half)siblings (per parent) is also λ. If fullsiblings are rare total expected sibs are 2λ.
Sums of Poisson-distributed random variables:

If $X_i \sim \mathrm{Pois}(\lambda_i)\,$ follow a Poisson distribution with parameter $\lambda_i\,$ and $X_i$ are independent, then

$Y = \sum_{i=1}^N X_i \sim \mathrm{Pois}\left(\sum_{i=1}^N \lambda_i\right)\,$

also follows a Poisson distribution whose parameter is the sum of the component parameters. A converse is Raikov's theorem, which says that if the sum of two independent random variables is Poisson-distributed, then so is each of those two independent random variables.

The sum of normalised square deviations is approximately distributed as chi-squared if the mean is of a moderate size ( $\lambda>5$ is suggested).^[8] If $X_1,\dots,X_N$ are observations from independent Poisson distributions with means $\lambda_1,\dots,\lambda_N$ then $\sum_{i=1}^N \frac{(X_i-\lambda_i)^2}{\lambda_i}\sim \chi^2$
The moment-generating function of the Poisson distribution with expected value λ is

$\mathrm{E}\left(e^{tX}\right)=\sum_{k=0}^\infty e^{tk} f(k;\lambda)=\sum_{k=0}^\infty e^{tk} {\lambda^k e^{-\lambda} \over k!} =e^{\lambda(e^t-1)}.$

All of the cumulants of the Poisson distribution are equal to the expected value λ. The nth factorial moment of the Poisson distribution is λⁿ.
The Poisson distributions are infinitely divisible probability distributions.
The directed Kullback-Leibler divergence between Pois(λ) and Pois(λ₀) is given by

$D_{\mathrm{KL}}(\lambda\|\lambda_0) = \lambda_0 - \lambda %2B \lambda \log \frac{\lambda}{\lambda_0}.$

Upper bound for the tail probability of a Poisson random variable $X \sim \text{Pois}(\lambda)$ .^[9] The proof uses a Chernoff bound argument.

$P(X \geq x) \leq \frac{e^{-\lambda} (e \lambda)^x}{x^x}, \text{ for } x > \lambda$

Similarly,

$P(X \leq x) \leq \frac{e^{-\lambda} (e \lambda)^x}{x^x}, \text{ for } x < \lambda$

Evaluating the Poisson distribution

Although the Poisson distribution is limited by

$0 < f(k,\lambda) \le f(\lfloor \lambda \rfloor,\lambda) < 1$ ,

the numerator and denominator of $f(k,\lambda)$ can reach extreme values for large values of $k$ or $\lambda$ .

If the Poisson distribution is evaluated on a computer with limited precision by first evaluating its numerator and denominator and then dividing the two, then a significant loss of precision may occur.

For example, with the common double precision a complete loss of precision occurs if $f(150, 150)$ is evaluated in this manner.

A more robust evaluation method is:

$\begin{align} f(k,\lambda) &= \exp{(\ln{(f(k,\lambda))})} \\ &= \exp{(\ln{(\frac{\lambda^k \exp{(-\lambda)}}{k!})})} \\ &= \exp{(k\ln{(\lambda)} - \lambda - \sum_{i=1}^k \ln{(i)})}. \end{align}$

Generating Poisson-distributed random variables

A simple algorithm to generate random Poisson-distributed numbers (pseudo-random number sampling) has been given by Knuth (see References below):

algorithm poisson random number (Knuth):
    init:
         Let L ← e^−λ, k ← 0 and p ← 1.
    do:
         k ← k + 1.
         Generate uniform random number u in [0,1] and let p ← p × u.
    while p > L.
    return k − 1.

While simple, the complexity is linear in λ. There are many other algorithms to overcome this. Some are given in Ahrens & Dieter, see References below. Also, for large values of λ, there may be numerical stability issues because of the term e^−λ. One solution for large values of λ is Rejection sampling, another is to use a Gaussian approximation to the Poisson.

Inverse transform sampling is simple and efficient for small values of λ, and requires only one uniform random number u per sample. Cumulative probabilities are examined in turn until one exceeds u.

Parameter estimation

Maximum likelihood

Given a sample of n measured values k_i we wish to estimate the value of the parameter λ of the Poisson population from which the sample was drawn. To calculate the maximum likelihood value, we form the log-likelihood function

$\begin{align} L(\lambda) & = \ln \prod_{i=1}^n f(k_i \mid \lambda) \\ & = \sum_{i=1}^n \ln\!\left(\frac{e^{-\lambda}\lambda^{k_i}}{k_i!}\right) \\ & = -n\lambda %2B \left(\sum_{i=1}^n k_i\right) \ln(\lambda) - \sum_{i=1}^n \ln(k_i!). \end{align}$

Take the derivative of L with respect to λ and equate it to zero:

$\frac{\mathrm{d}}{\mathrm{d}\lambda} L(\lambda) = 0 \iff -n %2B \left(\sum_{i=1}^n k_i\right) \frac{1}{\lambda} = 0. \!$

Solving for λ yields a stationary point, which if the second derivative is negative is the maximum-likelihood estimate of λ:

$\widehat{\lambda}_\mathrm{MLE}=\frac{1}{n}\sum_{i=1}^n k_i. \!$

Checking the second derivative, it is found that it is negative for all λ and k_i greater than zero, therefore this stationary point is indeed a maximum of the initial likelihood function:

$\frac{\partial^2 L}{\partial \lambda^2} = -\lambda^{-2}\sum_{i=1}^n k_i$

Since each observation has expectation λ so does this sample mean. Therefore it is an unbiased estimator of λ. It is also an efficient estimator, i.e. its estimation variance achieves the Cramér–Rao lower bound (CRLB). Hence it is MVUE. Also it can be proved that the sample mean is complete and sufficient statistic for λ.

Bayesian inference

In Bayesian inference, the conjugate prior for the rate parameter λ of the Poisson distribution is the Gamma distribution. Let

$\lambda \sim \mathrm{Gamma}(\alpha, \beta) \!$

denote that λ is distributed according to the Gamma density g parameterized in terms of a shape parameter α and an inverse scale parameter β:

$g(\lambda \mid \alpha,\beta) = \frac{\beta^{\alpha}}{\Gamma(\alpha)} \; \lambda^{\alpha-1} \; e^{-\beta\,\lambda} \qquad \text{ for } \lambda>0 \,\!.$

Then, given the same sample of n measured values k_i as before, and a prior of Gamma(α, β), the posterior distribution is

$\lambda \sim \mathrm{Gamma}(\alpha %2B \sum_{i=1}^n k_i, \beta %2B n). \!$

The posterior mean E[λ] approaches the maximum likelihood estimate $\widehat{\lambda}_\mathrm{MLE}$ in the limit as $\alpha\to 0,\ \beta\to 0$ .

The posterior predictive distribution of additional data is a Gamma-Poisson (i.e. negative binomial) distribution.

Confidence interval

A simple and rapid method to calculate an approximate confidence interval for the estimation of λ is proposed in Guerriero et al. (2009). This method provides a good approximation of the confidence interval limits, for samples containing at least 15 – 20 elements. Denoting by N the number of sampled points or events and by L the length of sample line (or the time interval), the upper and lower limits of the 95% confidence interval are given by:

$\lambda_{low}=\frac{(1-\frac{1.96}{\sqrt{N-1}}) N}{L}$

$\lambda_{upp}=\frac{(1%2B\frac{1.96}{\sqrt{N-1}}) N}{L}$

The "law of small numbers"

The word law is sometimes used as a synonym of probability distribution, and convergence in law means convergence in distribution. Accordingly, the Poisson distribution is sometimes called the law of small numbers because it is the probability distribution of the number of occurrences of an event that happens rarely but has very many opportunities to happen. The Law of Small Numbers is a book by Ladislaus Bortkiewicz about the Poisson distribution, published in 1898. Some have suggested that the Poisson distribution should have been called the Bortkiewicz distribution.^[10]

Notes

^ Gullberg, Jan (1997). Mathematics from the birth of numbers. New York: W. W. Norton. pp. 963–965. ISBN 039304002X.
^ S.D. Poisson, Probabilité des jugements en matière criminelle et en matière civile, précédées des règles générales du calcul des probabilitiés (Paris, France: Bachelier, 1837), page 206.
^ Ladislaus von Bortkiewicz, Das Gesetz der kleinen Zahlen [The law of small numbers] (Leipzig, Germany: B.G. Teubner, 1898). On page 1, Bortkiewicz presents the Poisson distribution. On pages 23-25, Bortkiewicz presents his famous analysis of "4. Beispiel: Die durch Schlag eines Pferdes im preussischen Heere Getöteten." (4. Example: Those killed in the Prussian army by a horse's kick.).
^ NIST/SEMATECH, '6.3.3.1. Counts Control Charts', e-Handbook of Statistical Methods, accessed 25 October 2006
^ McCullagh, Peter; Nelder, John (1989). Generalized Linear Models. London: Chapman and Hall. ISBN 0-412-31760-5. page 196 gives the approximation and the subsequent terms.
^ Johnson, N.L., Kotz, S., Kemp, A.W. (1993) Univariate Discrete distributions (2nd edition). Wiley. ISBN 0-471-54897-9, p163
^ Philip J. Boland. "A Biographical Glimpse of William Sealy Gosset". The American Statistician, Vol. 38, No. 3. (Aug., 1984), pp. 179-183.. http://wfsc.tamu.edu/faculty/tdewitt/biometry/Boland%20PJ%20(1984)%20American%20Statistician%2038%20179-183%20-%20A%20biographical%20glimpse%20of%20William%20Sealy%20Gosset.pdf. Retrieved 2011-06-22. "At the turn of the 19th century, Arthur Guinness, Son & Co. became interested in hiring scientists to analyze data concerned with various aspects of its brewing process. Gosset was to be one of the first of these scientists, and so it was that in 1899 he moved to Dublin to take up a job as a brewer at St. James' Gate... Student published 22 papers, the first of which was entitled "On the Error of Counting With a Haemacytometer" (Biometrika, 1907). In it, Student illustrated the practical use of the Poisson distribution in counting the number of yeast cells on a square of a haemacytometer. Up until just before World War II, Guinness would not allow its employees to publish under their own names, and hence Gosset chose to write under the pseudonym of "Student.""
^ Box, Hunter and Hunter. Statistics for experimenters. Wiley. p. 57.
^ Massimo Franceschetti and Olivier Dousse and David N. C. Tse and Patrick Thiran (2007). "Closing the Gap in the Capacity of Wireless Networks Via Percolation Theory". IEEE Transactions on Information Theory. http://circuit.ucsd.edu/~massimo/Journal/IEEE-TIT-Capacity.pdf.
^ Good, I. J. (1986). "Some statistical applications of Poisson's work". Statistical Science 1 (2): 157–180. doi:10.1214/ss/1177013690. JSTOR 2245435.

References

Joachim H. Ahrens, Ulrich Dieter (1974). "Computer Methods for Sampling from Gamma, Beta, Poisson and Binomial Distributions". Computing 12 (3): 223–246. doi:10.1007/BF02293108.
Joachim H. Ahrens, Ulrich Dieter (1982). "Computer Generation of Poisson Deviates". ACM Transactions on Mathematical Software 8 (2): 163–179. doi:10.1145/355993.355997.
Ronald J. Evans, J. Boersma, N. M. Blachman, A. A. Jagers (1988). "The Entropy of a Poisson Distribution: Problem 87-6". SIAM Review 30 (2): 314–317. doi:10.1137/1030059.
V. Guerriero, A. Iannace, S. Mazzoli, M. Parente, S. Vitale, M. Giorgioni (2009). "Quantifying uncertainties in multi-scale studies of fractured reservoir analogues: Implemented statistical analysis of scan line data from carbonate rocks". Journal of Structural Geology (Elsevier). doi:10.1016/j.jsg.2009.04.016.
Donald E. Knuth (1969). Seminumerical Algorithms. The Art of Computer Programming, Volume 2. Addison Wesley.

External links

POISSON() function in most popular spreadsheets

Probability distributions

Discrete univariate with finite support

Benford · Bernoulli · Beta-binomial · binomial · categorical · hypergeometric · Poisson binomial · Rademacher · discrete uniform · Zipf · Zipf-Mandelbrot

Discrete univariate with infinite support

beta negative binomial · Boltzmann · Conway–Maxwell–Poisson · discrete phase-type · extended negative binomial · Gauss–Kuzmin · geometric · logarithmic · negative binomial · parabolic fractal · Poisson · Skellam · Yule–Simon · zeta

Continuous univariate supported on a bounded interval, e.g. [0,1]

Arcsine · ARGUS · Balding-Nichols · Bates · Beta · Noncentral beta · Irwin–Hall · Kumaraswamy · logit-normal · raised cosine · triangular · U-quadratic · uniform · Wigner semicircle

Continuous univariate supported on a semi-infinite interval, usually [0,∞)

Benini · Benktander 1st kind · Benktander 2nd kind · Beta prime · Bose–Einstein · Burr · chi-squared · chi · Coxian · Dagum · Davis · Erlang · exponential · F · Fermi–Dirac · folded normal · Fréchet · Gamma · generalized inverse Gaussian · half-logistic · half-normal · Hotelling's T-squared · hyper-exponential · hypoexponential · inverse chi-squared (scaled-inverse-chi-squared) · inverse Gaussian · inverse gamma · Kolmogorov · Lévy · log-Cauchy · log-Laplace · log-logistic · log-normal · Maxwell–Boltzmann · Maxwell speed · Mittag–Leffler · Nakagami · noncentral chi-squared · Pareto · phase-type · Rayleigh · relativistic Breit–Wigner · Rice · Rosin–Rammler · shifted Gompertz · truncated normal · type-2 Gumbel · Weibull · Wilks' lambda

Continuous univariate supported on the whole real line (−∞, ∞)

Cauchy · exponential power · Fisher's z · generalized normal · generalized hyperbolic · geometric stable · Gumbel · Holtsmark · hyperbolic secant · Landau · Laplace · Linnik · logistic · noncentral t · normal (Gaussian) · normal-inverse Gaussian · skew normal · slash · stable · Student's t · type-1 Gumbel · variance-gamma · Voigt

Continuous univariate with support whose type varies

generalized extreme value · generalized Pareto · Tukey lambda · q-Gaussian · q-exponential · shifted log-logistic

Mixed continuous-discrete univariate distributions

rectified Gaussian

Multivariate (joint)

Discrete: Ewens · multinomial · multivariate Pólya · negative multinomial Continuous: Dirichlet · Generalized Dirichlet · multivariate normal · Multivariate stable · multivariate Student · normal-scaled inverse gamma · normal-gamma Matrix-valued: inverse-Wishart · matrix normal · Wishart

Directional

Univariate (circular) directional: Circular uniform · univariate von Mises · wrapped normal · wrapped Cauchy · wrapped exponential · wrapped Lévy Bivariate (spherical): Kent Bivariate (toroidal): bivariate von Mises Multivariate: von Mises–Fisher · Bingham

Degenerate and singular

Degenerate: discrete degenerate · Dirac delta function Singular: Cantor

Families

Circular · compound Poisson · elliptical · exponential · natural exponential · location-scale · maximum entropy · mixture · Pearson · Tweedie · wrapped

Some common univariate probability distributions

Continuous	beta Cauchy chi-squared exponential F gamma Laplace log-normal normal Pareto Student's t uniform Weibull

Discrete	Bernoulli binomial discrete uniform geometric hypergeometric negative binomial Poisson

List of probability distributions